NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

https://doi.org/10.4230/lipics.ecrts.2025.21

Bakita, Joshua; Anderson, James H (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Mancuso, Renato (Ed.)
As GPU-using tasks become more common in embedded, safety-critical systems, efficiency demands necessitate sharing a single GPU among multiple tasks. Unfortunately, existing ways to schedule multiple tasks onto a GPU often either result in a loss of ability to meet deadlines, or a loss of efficiency. In this work, we develop a system-level spatial compute partitioning mechanism for NVIDIA GPUs and demonstrate that it can be used to execute tasks efficiently without compromising timing predictability. Our tool, called nvtaskset, supports composable systems by not requiring task, driver, or hardware modifications. In our evaluation, we demonstrate sub-1-μs overheads, stronger partition enforcement, and finer-granularity partitioning when using our mechanism instead of NVIDIA’s Multi-Process Service (MPS) or Multi-instance GPU (MiG) features.
more » « less
Full Text Available
Concurrent FFT Execution on GPUs in Real-Time

https://doi.org/10.1109/PDP66500.2025.00029

Ali, Syed W; Goh, Joseph; Bakita, Joshua; Chakraborty, Samarjit; Anderson, James H (March 2025, IEEE)

Free, publicly-accessible full text available March 12, 2026
Work in Progress: Increasing Schedulability via on-GPU Scheduling

https://doi.org/10.1109/RTAS65571.2025.00016

Bakita, Joshua; Anderson, James H (May 2025, IEEE)

Free, publicly-accessible full text available May 6, 2026
Demystifying NVIDIA GPU Internals to Enable Reliable GPU Management

https://doi.org/10.1109/RTAS61025.2024.00031

Bakita, Joshua; Anderson, James H (May 2024, IEEE)

Full Text Available
Demystifying NVIDIA GPU Internals to Enable Reliable GPU Management

Bakita, Joshua; Anderson, James H (May 2024, Proceedings of the 30th IEEE Real-Time and Embedded Technology and Appli cations Symposium)

Full Text Available
Hardware Compute Partitioning on NVIDIA GPUs*

https://doi.org/10.1109/RTAS58335.2023.00012

Bakita, Joshua; Anderson, James H. (May 2023, Proceedings of the 29th IEEE Real-Time and Embedded Technology and Applications Symposium)

Embedded and autonomous systems are increasingly integrating AI/ML features, often enabled by a hardware accelerator such as a GPU. As these workloads become increasingly demanding, but size, weight, power, and cost constraints remain unyielding, ways to increase GPU capacity are an urgent need. In this work, we provide a means by which to spatially partition the computing units of NVIDIA GPUs transparently, allowing oft-idled capacity to be reclaimed via safe and effcient GPU sharing. Our approach works on any NVIDIA GPU since 2013, and can be applied via our easy-to-use, user-space library titled libsmctrl. We back the design of our system with deep investigations into the hardware scheduling pipeline of NVIDIA GPUs. We provide guidelines for the use of our system, and demonstrate it via an object detection case study using YOLOv2.
more » « less
Full Text Available
Enabling GPU Memory Oversubscription via Transparent Paging to an NVMe SSD

https://doi.org/10.1109/RTSS55097.2022.00039

Bakita, Joshua; Anderson, James H. (December 2022, roceedings of the 43rd IEEE Real-Time Systems Symposium)

Full Text Available
Minimizing DAG Utilization by Exploiting SMT

https://doi.org/10.1109/RTAS54340.2022.00029

Osborne, Sims Hill; Bakita, Joshua; Chen, Jingyuan; Yandrofski, Tyler; Anderson, James H. (May 2022, Proceedings of the 28th IEEE Real-Time and Embedded Technology and Applications Symposium)

Full Text Available
TimeWall: Enabling Time Partitioning for Real-Time Multicore+Accelerator Platforms

https://doi.org/10.1109/RTSS52674.2021.00048

Amert, Tanya; Tong, Zelin; Voronov, Sergey; Bakita, Joshua; Smith, F. Donelson; Anderson, James H. (December 2021, Proceedings of the 42nd IEEE Real-Time Systems Symposium)

Full Text Available
Simultaneous Multithreading in Mixed-Criticality Real-Time Systems

https://doi.org/10.1109/RTAS52030.2021.00030

Bakita, Joshua; Ahmed, Shareef; Osborne, Sims Hill; Tang, Stephen; Chen, Jingyuan; Smith, F. Donelson; Anderson, James H. (May 2021, Proceedings of the 27th IEEE Real-Time and Embedded Technology and Applications Symposium)

Full Text Available

« Prev Next »

Search for: All records